Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation

نویسندگان

چکیده

The Scene Graph Generation (SGG) task aims to detect all the objects and their pairwise visual relationships in a given image. Although SGG has achieved remarkable progress over last few years, almost existing models follow same training paradigm: they treat both object predicate classification as single-label problem, ground-truths are one-hot target labels. However, this prevalent paradigm overlooked two characteristics of current datasets: 1) For positive samples, some specific subject-object instances may have multiple reasonable predicates. 2) negative there numerous missing annotations. Regardless characteristics, easy be confused make wrong predictions. To end, we propose novel model-agnostic Label Semantic Knowledge Distillation (LS-KD) for unbiased SGG. Specifically, LS-KD dynamically generates “soft" label each instance by fusing predicted Distribution (LSD) with its original label. LSD reflects correlations between categories. Meanwhile, different strategies predict LSD: iterative self-KD synchronous self-KD. Extensive ablations results on three tasks attested superiority generality our proposed LS-KD, which can consistently achieve decent trade-off performance

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-label Semantic Scene Classification

In classic pattern recognition problems, classes are mutually exclusive by definition. Classification errors occur when the classes overlap in the feature space. We examine a different situation, occurring when the classes are, by definition, not mutually exclusive. Such problems arise in semantic scene and document classification and in medical diagnosis. We present a framework to handle such ...

متن کامل

Scene Graph Generation from Images

Image understanding by computer is advancing exponentially these days due to the phenomenal success of deep learning, but there is still much work left for the computers to reach human level perception. Image classification (sometimes with localization) is one of the standard task, but this is far from the image understanding. The other tasks such as image caption generation or visual question ...

متن کامل

Scene Graph Generation by Belief RNNs

Understanding and describing the scene beyond an image has great value. It is a 1 step forward from recognizing individual objects and their relationships in isolation. 2 In addition, Scene graph is a crucial step towards a deeper understanding of a visual 3 scene. We present a method in an end-to-end model that given an image generates 4 a scene graph including nodes that corresponds to the ob...

متن کامل

Semantic Parsing for Text to 3D Scene Generation

We propose text-to-scene generation as an application for semantic parsing. This is an application that grounds semantics in a virtual world that requires understanding of common, everyday language. In text to scene generation, the user provides a textual description and the system generates a 3D scene. For example, Figure 1 shows the generated scene for the input text “there is a room with a c...

متن کامل

Knowledge Base Population using Semantic Label Propagation

A crucial aspect of a knowledge base population system that extracts new facts from text corpora, is the generation of training data for its relation extractors. In this paper, we present a method that maximizes the effectiveness of newly trained relation extractors at a minimal annotation cost. Manual labeling can be significantly reduced by Distant Supervision (DS), which is a method to const...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2023

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2023.3282349